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pyrE gene sequence 

SEQ ID N0:1 

Pyre/niger Length: 1578 March 9, 2001 09:28 Type: N Check: 22 

1 GGGT TAATGT GAAGGCGTTA GTGGTAATGT ATATTAATGG TGAGATGGGC 

51 TTTGATTGGG TTTAATTGGA ATCTGTATAT TTTCAGATGG AGTCAACTTT 

101 TGAATGGCCA AT AT AT C C T C GGCGATACCG TCGGAGATAA GATAAGAATA 

151 ATCGCACACT ATTCCCAAAG CATACTGGTA CATACTGCAT TCGGCTAGTG 

201 CGGGGTGCTT ACCTCATCCA CCCGAATGAG CCCAACTTTT TTGTCTCAAT 

2 51 CAATAATTGC ATCCAAATTC CCCCGCAACT TCCCCCTCCA ACCCCGTGTC 

3 01 TAT AC CACT C CCTCCACACC C ACACAAT C A CAATGGCTCT CCCTGCCTAC 
351 AAGACCGCCT TCCTGGAGTC TCTCGTCGGC CAACGTGCTG ACTTTCGGCA 
401 CCTTCACCCT GAAGTCGGGT CGCCGTGCGT CACCCCTCCA ACACCGGCAT 
451 TATCGCAATC GGAAGAC TTA CCACTGTATA CAGACTCCCC CTACTTCTTC 
501 AACGCCGGCA TCTTCAACAC CGCCTCTCTC CTCTCCGCCC TCTCCACCAT 
551 GGCCCACACC AT CAT CAC C T TCCTCGCTGA GAACCCTTCC ATCCCCAAGC 
601 CCGACGTCAT GCTTCGGGTA AAAAACCCCC TCTTTCCCCA ATACCCCACT 
651 TCCACTCAAC AAC C CAT AAA TAACTAACAA AAACCCCCTA AACAGCCCCG 
701 CATACAAAGG CATCCCCCTC GCGTGCGCCA CCCTCCTTGA ACTCAACCGC 
751 ATCGACCCCG CCACCTGGGG CAGCGTGTCC TACAGCTACA AC CGCAAAGA 
801 AGC CAAGGAT CACGGCGAAG GCGGCAACAT TGTCGGCGCC GCTCTGAAGG 
851 GCAAGACCGT GCTTGTGATC GACGATGTCA TCACGGCCGG TACCGCCATG 
901 CGTGAGACCC TCAACCTGGT CGCCAAGGAG GGCGGCAAGG TCGTCGGATT 
951 CACTGTTGCT CTGGACCGCT TGGAGAAGAT GCCCGGACCC AAGGACGAGA 

10 01 ACGGTGTCGA GGAC GATAAG CCCAGAATGA GTGCTATGGG TCAGATCCGT 

1051 AAGGAGTATG GTGTGCCCAC GACGAGTATT GTTACTCTGG ATGAT TTGAT 
1101 CAAGT TGATG CAGGCGAAGG GCAATGAGGC CGATATGAAG CGGTTGGAGG 
1151 AGTATAGGGC TAAGT AT C AG GCTAGTGATT AGTCGGTTTC ATTGACCGAT 
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1201 TGTTTGGGTG GGTGTGAGAG GTTAGGTTAG GTTGTGGGCG TAGGAATGAA 

1251 AAGCTGTATA CATAGGGGCC TGAAGAGGTG CGTAGAGACG GTCGTGAGAT 

13 01 GTTTTATGTC AAAATCTTGA ACAAATGACA CCTTAAAAAA GACCCCTTGG 

13 51 TTTCAGCTGA ATTAGCCCGG AAAGATGCTC GGCACGCCAT GAGTCTAGCC 

14 01 CACTCAGTGG GCACCCGTTT CCCACATTTG AAGTGGCCGA CGCTTATTTG 
1451 GCTGAGGCTG TGGCCTGGAA AGGCACTATG GCGTGCTGCG GTACAAGGCC 
1501 GGGGCTGGCG TACGAACCAC GACGCCCGAA GGGAACTCTT CGGTCTTACT 
1551 ACTACTATGT CCCCAGTTGA CCCCCCGA 



SEQ ID NO: 2 

Translation of pyrE (1-1578) 
Universal code 

1 GGGTTAATGTGAAGGCGTTAGTGGTAATGTATATTAATGGTGAGATGGGCTTTGATTGGG 
CCCAATTACACTTCCGCAATCACCATTACATATAATTACCACTCTACCCGAAACTAACCC 

1 GLM*RR*W*CILMVRWALIG 
1 G*CEGVSGNVY*W*DGL*LG 
1 VNVKALVVMYINGEMGFDWV 



6 1 TTTAATTGGAATCTGTATATTTTCAGATGGAGTCAACTTTTGAATGGCCAATATATCCTC 
AAATTAACCTTAGACATATAAAAGTCTACCTCAGTTGAAAACTTACCGGTTATATAGGAG 

21 FNWNLYI FRWSQLLNGQYI L 

21 LIGICIFSDGVNF*MANISS 
21 * LESVYFQMESTFEWPIYPR 



121 GGCGATACCGTCGGAGATAAGATAAGAATAATCGCACACTATTCCCAAAGCATACTGGTA 
CCGCTATGGCAGCCTCTATTCTATTCTTATTAGCGTGTGATAAGGGTTTCGTATGACCAT 

41 GDTVGDKIRI IAHYSQSILV 

41 AIPSEIR*E*SHTIPKAYWY 
41 RYRRR*DKNNRTLFPKHTGT 



181 CATACTGCATTCGGCTAGTGCGGGGTGCTTACCTCATCCACCCGAATGAGCCCAACTTTT 
GTATGACGTAAGCCGATCACGCCCCACGAATGGAGTAGGTGGGCTTACTCGGGTTGAAAA 

61 HTAFG*CGVLTSSTRMSPTF 
61 ILHSASAGCLPHPPE*AQLF 
61 YCIRLVRGAYLIHPNEPNFF 
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241 TTGTCTCAATCAATAATTGCATCCAAATTCCCCCGCAACTTCCCCCTCCAACCCCGTGTC 
AACAGAGTTAGTTATTAACGTAGGTTTAAGGGGGCGTTGAAGGGGGAGGTTGGGGCACAG 

81 LSQSIIASKFPRNFPLQPRV 
81 CLNQ*LHPNSPATSPSNPVS 
81 VSINNCIQIPPQLPPPTPCL 



3 01 TATACCACTCCCTCCACACCCACACAATCACAATGGCTCTCCCTGCCTACAAGACCGCCT 
ATATGGTGAGGGAGGTGTGGGTGTGTTAGTGTTACCGAGAGGGACGGATGTTCTGGCGGA 

101 YTTPSTPTQSQWLSLPTRPP 
101 I PLPPHPHNHNGS PCLQDRL 

101 YHSLHTHTIT M A L P A Y K T A F 

3 61 TCCTGGAGTCTCTCGTCGGCCAACGTGCTGACTTTCGGCACCTTCACCCTGAAGTCGGGT 
AGGACCTCAGAGAGCAGCCGGTTGCACGACTGAAAGCCGTGGAAGTGGGACTTCAGCCCA 
???????????????????????? 
121 SWSLSSANV LTFGTFTLKSG 

121 PGVSRRPTC*LSAPSP*SRV 
121 LESLVGQRADFRHLHPEVGS 

INTRON I 

421 CGCCGTGCGTCACCCCTCCAACACCGGCATTATCGCAATCGGAAGACTTACCACTGTATA 

GCGGCACGCAGTGGGGAGGTTGTGGCCGTAATAGCGTTAGCCTTCTGAATGGTGACATAT 

141 RRAS PLQHRHYRNRKTYHC I 

141 AVRHPSNTGI IAIGRLTTVY 

141 PCVTPPTPALSQSEDLPLYT 

481 CAGACTCCCCCTACTTCTTCAACGCCGGCATCTTCAACACCGCCTCTCTCCTCTCCGCCC 
GTCTGAGGGGGATGAAGAAGTTGCGGCCGTAGAAGTTGTGGCGGAGAGAGGAGAGGCGGG 

161 QTPPTSSTPASSTPPLSSPP 
161 RLPLLLQRRHLQHRLSPLRP 
161 D SPYFFNAGIFNTASLLSAL 

Ncol 

541 TCTCCACCATGGCCCACACCATCATCACCTTCCTCGCTGAGAACCCTTCCATCCCCAAGC 
AGAGGTGGTACCGGGTGTGGTAGTAGTGGAAGGAGCGACTCTTGGGAAGGTAGGGGTTCG 

181 SPPWPTPSSPSSLRTLPSPS 
181 LHHGPHHHHLPR* EPFHPQA 

181 STMAHTIITFLAENPSIPKP 
??????????? INTRON II 

601 CCGACGTCATGCTTCGGGTAAAAAACCCCCTCTTTCCCCAATACCCCACTTCCACTCAAC 

GGCTGCAGTACGAAGCCCATTTTTTGGGGGAGAAAGGGGTTATGGGGTGAAGGTGAGTTG 

201 P T S C F G *KTPSFPNTPLPLN 

201 RRHASGKKPPLSPIPHFHST 
201 D V M LRVKNPLFPQYPTSTQQ 
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661 AACCCATAAATAACTAACAAAAACCCCCTAAACAGCCCCGCATACAAAGGCATCCCCCTC 

TTGGGTATTTATTGATTGTTTTTGGGGGATTTGTCGGGGCGTATGTTTCCGTAGGGGGAG 

221 N P * ITNKNPLNS PAYKGIPL 

221 THK*LTKTP*TAPHTKASPS 
221 PINN*QKPPKQPRIQRHPPR 

72 1 GCGTGCGCCACCCTCCTTGAACTCAACCGCATCGACCCCGCCACCTGGGGCAGCGTGTCC 
CGCACGCGGTGGGAGGAACTTGAGTTGGCGTAGCTGGGGCGGTGGACCCCGTCGCACAGG 

241 ACATLLELNRIDPATWGSVS 
241 RAPPSLNSTASTPPPGAACP 
241 VRHPP*TQPHRPRHLGQRVL 

7 81 TACAGCTACAACCGCAAAGAAGCCAAGGATCACGGCGAAGGCGGCAACATTGTCGGCGCC 
ATGTCGATGTTGGCGTTTCTTCGGTTCCTAGTGCCGCTTCCGCCGTTGTAACAGCCGCGG 

261 YSYKfRKEAKDHGEGGNIVGA 
261 TATTAKKPRITAKAATLSAP 
261 QLQPQRSQGSRRRRQHCRRR 

Kpnl 

841 GCTCTGAAGGGCAAGACCGTGCTTGTGATCGACGATGTCATCACGGCCGGTACCGCCATG 
CGAGACTTCCCGTTCTGGCACGAACACTAGCTGCTACAGTAGTGCCGGCCATGGCGGTAC 

281 ALKGKTVLVI DDVI TAGTAM 

281 L*RARPCL*STMSSRPVPPC 
281 SEGQDRACDRRCHHGRYRHA 

901 CGTGAGACCCTCAACCTGGTCGCCAAGGAGGGCGGCAAGGTCGTCGGATTCACTGTTGCT 
GCACTCTGGGAGTTGGACCAGCGGTTCCTCCCGCCGTTCCAGCAGCCTAAGTGACAACGA 

301 RETLNLVAKEGGKVVGFTVA 
301 VRPSTWS PRRAARSSDSLLL 

301 *DPQPGRQGGRQGRRIHCCS 

9 61 C TGGAC CGCTTGGAGAAGATGC C CGGAC CCAAGGACGAGAACGGTGT CGAGGACGATAAG 

GACCTGGCGAACCTCTTCTACGGGCCTGGGTTCCTGCTCTTGCCACAGCTCCTGCTATTC 

321 LDRLEKMPGPKDENGVEDDK 
321 WTAWRRCPDPRTRTVSRTIS 
321 GPLGEDARTQGRERCRGR*A 

1021 CCCAGAATGAGTGCTATGGGTCAGATCCGTAAGGAGTATGGTGTGCCCACGACGAGTATT 
GGGTCTTACTCACGATACCCAGTCTAGGCATTCCTCATACCACACGGGTGCTGCTCATAA 

341 PRMSAMGQ I RKEYGVPTTS I 

341 PE *VLWVRSVRSMVCPRRVL 

341 QNECYGSDP*GVWCAHDEYC 
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10 81 GTTACTCTGGATGATTTGATCAAGTTGATGCAGGCGAAGGGCAATGAGGCCGATATGAAG 
CAATGAGACCTACTAAACTAGTTCAACTACGTCCGCTTCCCGTTACTCCGGCTATACTTC 

361 VTLDDLIKLMQAKGNEADMK 
3 61 ~~L L W M I * S S * C R R R A M R P I * s 

361 YSG*FDQVDAGEGQ*GRYEA 

1141 CGGTTGGAGGAGTATAGGGCTAAGTATCAGGCTAGTGATTAGTCGGTTTCATTGACCGAT 
GCCAACCTCCTCATATCCCGATTCATAGTCCGATCACTAATCAGCCAAAGTAACTGGCTA 

381 RLEEYRAKYQASD * S V S L T D 

381 GWRSIGLSIRLVISRFH*P1 
381 VGGV*G*VSG**LVGFIDRL 

1201 TGTTTGGGTGGGTGTGAGAGGTTAGGTTAGGTTGTGGGCGTAGGAATGAAAAGCTGTATA 
ACAAACCCACCCACACTCTCCAATCCAATCCAACACCCGCATCCTTACTTTTCGACATAT 

401 CLGGCERLG*VVGVGMKSCI 
401 VWVGVRG*VRLWA* E * KAVY 

401 FGWV* EVRLGCGRRNEKLYT 

12 61 CATAGGGGCCTGAAGAGGTGCGTAGAGACGGTCGTGAGATGTTTTATGTCAAAATCTTGA 

GTATCCCCGGACTTCTCCACGCATCTCTGCCAGCACTCTACAAAATACAGTTTTAGAACT 

421 HRGLKRCVETVVRCFMSKS* 
421 IGA*RGA*RRS *DVLCQNLE 

421 *GPEEVRRDGREMFYVKILN 

13 21 ACAAATGACACCTTAAAAAAGACCCCTTGGTTTCAGCTGAATTAGCCCGGAAAGATGCTC 

TGTTTACTGTGGAATTTTTTCTGGGGAACCAAAGTCGACTTAATCGGGCCTTTCTACGAG 

441 TNDTLKKTPWFQLM* PGKML 

441 QMTP*KRPLGFS*ISPERCS 
44 1 K*HLKKDPLVSAELARKDAR 

13 81 GGCACGCCATGAGTCTAGCCCACTCAGTGGGCACCCGTTTCCCACATTTGAAGTGGCCGA 
CCGTGCGGTACTCAGATCGGGTGAGTCACCCGTGGGCAAAGGGTGTAAACTTCACCGGCT 

461 GTP*V*PTQWAPVSHI*SGR 
461 ARHESSPLSGHPFPTFEVAD 
461 HAMSLAHSVGTRFPHLKWPT 

1441 CGCTTATTTGGCTGAGGCTGTGGCCTGGAAAGGCACTATGGCGTGCTGCGGTACAAGGCC 
GCGAATAAACCGACTCCGACACCGGACCTTTCCGTGATACCGCACGACGCCATGTTCCGG 

481 RLFG*GCGLERHYGVLRYKA 
48 1 AYLAEAVAWKGTMACCGTRP 
481 L IWLRLWPGKALWRAAVQGR 
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GGGGCTGGCGTACGAACCACGACGCCCGAAGGGAACTCTTCGGTCTTACTACTACTATGT 
CCCCGACCGCATGCTTGGTGCTGCGGGCTTCCCTTGAGAAGCCAGAATGATGATGATACA 

GAGVRTTTPEGNS SVLLLLC 
GLAYEPRRPKGTLRSYYYYV 
GWRTNHDARRELFGLTTTMS 



1561 



CCCCAGTTGACCCCCCGA 
GGGGTCAACTGGGGGGCT 



521 
521 
521 



P Q L T P R 
P S * P P 
P V D P P 



FIG. 15F 



